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In games with a large number of players where players may have overlapping objectives, the analysis 
of stable outcomes typically depends on player types. A special case is when a large part of the player 
population consists of imitation types: that of players who imitate choice of other (optimizing) types. 
Game theorists typically study the evolution of such games in dynamical systems with imitation rules. 
In the setting of games of infinite duration on finite graphs with preference orderings on outcomes for 
player types, we explore the possibility of imitation as a viable strategy. In our setup, the optimising 
players play bounded memory strategies and the imitators play according to specifications given by 
automata. We present algorithmic results on the eventual survival of types. 



1 Summary 

Imitation is an important heuristic studied by game theorists in the analysis of large games, in both 
extensive form games with considerable structure, and repeated normal form games with large number 
of players. One reason for this is that notions of rationality underlying solution concepts are justified by 
players' assumptions about how other players play, iteratively. In such situations, players' knowledge 
of the types of other players alters game dynamics. Skilled players can then be imitated by less skilled 
ones, and the former can then strategize about how the latter might play. In games with a large number 
of players, both strategies and outcomes are studied using distributions of player types. 

The dynamics of imitation, and strategizing of optimizers in the presence of imitators can give rise to 
interesting consequences. For instance, in the game of chess, if the player playing white somehow knows 
that her opponent will copy her move for move then the following simple sequence of moves allows her 
to checkmate her opponent Q: 

l.e3 e6 2.Qf3 Qf6 3.Qg3 Qg6 4.Nf3 Nf6 5.Kdl Kd8 6.Be2 Be7 7.Rel Re8 
8.Nc3 Nc6 9.Nb5 Nb4 10.Qxc7# 

On the other hand, we can have the scenario where every player is imitating someone or the other 
and the equilibrium attained maybe highly inefficient. This is usually referred to as 'herd behaviour' and 
has been studied for instance in Q. 

In an ideal world, where players have unbounded resources and computational ability, each of them 
can compute their optimal strategies and play accordingly and thus we can predict optimal play. But 
in reality, this is seldom the case. Players are limited in their resources, in computational ability and 
their knowledge of the game. Hence, in large games it is not possible for such players to compute their 
optimal strategies beforehand by considering all possible scenarios that may arise during play. Rather, 
they observe the outcome of the game and then strategise dynamically. In such a setting again, imitation 
types make sense. 

A resource bounded player may attach some cost to strategy selection. For such a player, imitating 
another player who has been doing extensive research and computation may well be worthwhile, even if 

'This is called 'monkey-chess' in chess parlance. 
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her own outcomes are less than optimal. What is lost in sub-optimal outcomes may be gained in avoiding 
expensive strategisation. 

Thus, in a large population of players, where resources and computational abilities are asymmetri- 
cally distributed, it is natural to consider a population where the players are predominantly of two kinds: 
optimisers and imitatorsH Asymmetry in resources and abilities can then lead to different types of imi- 
tation and thus ensure that we do not end up with 'herd behaviour' of the kind referred to above. Mutual 
reasoning and strategising process between optimizers and imitators leads to interesting questions for 
game dynamics in these contexts. 

Imitation is typically modelled in the dynamical systems framework in game theory. Schlag ( lfl2l ") 
studies a model of repeated games where a player in every round samples one other player according 
to some sampling procedure and then either imitates this player or sticks to her own move. He shows 
that the strategy where a player imitates the sampled player with a probability that is proportional to the 
difference in their payoffs, is the one that attains the maximum average payoff in the model. He also 
gives a simple counterexample to show that the naive strategy of 'imitate if better' may not always be 
improving. Banerjee (O) studies a sequential decision model where each decision maker may look at 
the decisions made by the previous decision makers and imitate them. He shows that the decision rules 
that are chosen by optimising individuals are characterised by herd behaviour, i.e., people do what others 
are doing rather than using their own information. He also shows that such an equilibrium is inefficient. 
Levine and Pesendorfer ([7 ]) study a model where existing strategies are more likely to be imitated than 
new strategies are to be introduced. 

The common framework in all of the above studies is repeated non-zero-sum normal form games 
where the questions asked of the model are somewhat different from standard ones on equilibria. Since 
all players are not optimizers, we do not speak of equilibrium profiles as such but optimal strategies for 
optimizers and possibly suboptimal outcomes for imitators. In the case of imitators, since they keep 
switching (imitate i for 2 moves, j for 3 moves, then again i for 1 move, etc.) studies consider stability 
of imitation patterns, what types of imitation survive eventually, since these would in turn determine play 
by optimizers and thus stable subgames, thus determining stable outcomes. Note that, as in the example 
of chess above, imitation and hence the study of system dynamics of this kind, makes equal sense in 
large turn based extensive form games among resource bounded players as well. 

For finitely presented infinite games the stability questions above can be easily posed and answered 
in automata theoretic ways, since typically bounded memory strategies suffice for optimal solutions, and 
stable imitation patterns can be analysed algorithmically. Indeed, this also provides a natural model for 
resource bounded players as finite state automata. 

With this motivation, we consider games of unbounded duration on finite graphs among players with 
overlapping objectives where the population is divided into players who optimise and others who imitate. 
Unbounded play is natural in the study of imitation as a heuristic, since 'losses' incurred per move may be 
amortised away and need not affect eventual outcomes very much. Imitator types specify how and who 
to imitate and are given using finite state transducers. Since plays eventually settle down to connected 
components, players' preferences are given using orderings on Muller sets ifTTll . In this work, we study 
turn-based games so as to use the set of techniques already available for the analysis of such games. 

In this setting we address the following questions and present algorithmic results: 

• If the optimisers and the imitators play according to certain specifications, is a global outcome 
eventually attained? 



There would also be a third kind of players, randomisers, who play any random strategy, but we do not consider such 
players in this exposition. 
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• What sort of imitative behaviour (subtypes) eventually survive in the game? 

• How worse-off are the imitators from an equilibrium outcome? 

Infinite two-player turn-based games on finite graphs have been extensively studied in the literature. 
A seminal result by Biichi and Landweber [4] showed that for Muller objectives, winning strategies exist 
in bounded memory strategies and can be effectively synthesised. Martin [ 8 ] showed that such games 
with Borel winning conditions are sure-determined (one of the players always has a winning strategy 
from every vertex). Zielonka |[T3l gave an insightful analysis of Muller games and provided an elegant 
algorithm to compute bounded memory winning strategies. 

For concurrent-move games, sure determinacy does not hold, and the optimal value determinacy (the 
values of both the players at every vertex sum to 1) for concurrent-move games with Borel objectives 
was proved in Q. Concurrent games with qualitative reachability and more general parity objectives 
have been studied in j2l [TJ- Such games have also been extended to the multiplayer setting where the 
objectives of the players are allowed to overlap. (21|6[ show that when the objectives are win-lose Borel, 
subgame perfect equilibria exist. [11] show that bounded memory equilibrim tuples exist in turn based 
games even when the objectives are not win-lose but every player has preferences over the various Muller 
sets. 

2 Games, Strategies and Objectives 

The model of games we present is the standard model of turn based games of unbounded duration on 
finite graphs. For any positive integer n, let [n] = {1,. . . ,«}. 

Definition 1 Let n£N,n>l. An n-player game arena is a directed graph 5f = (Vi, . . .V„,A,E), where 
Vi are finite sets of game positions with Vi n Vj = %for i ^ j, V = UieW ^ ^ ^ a fi n ^ te set of moves, and 
E C (V x A x V) is the move relation that satisfies the following conditions: 

1. For every v, Vi , V2 G V and a,b G A, if (y,a,v\ ) G E and (v,b, V2) G E then a^b. 

2. For every v GV, there exists ad A and V G V such that (v, a, v') G E. 

When an initial position vo G V is specified, we call (W,Vq) an initialised arena or just an arena. 

In this model, we assume for convenience that the moves of all players are the same. When v G Vj, 
we say that player i owns the vertex v. A game arena is thus a finite graph with nodes labelled by players 
and edges labelled by moves such that no two edges out of a vertex share a common label and there are 
no dead ends. For a vertex v G V, let vE denote its set of neighbours: vE = {v'|(v,a,v / ) G E for some 
a G A}. For v G V and a G A, let v[a] = {v'\(v,a,v') G E}; v[a] is either empty or the singleton {v'}. In 
the latter case, we say a is enabled at v and write v[a] = v' . For u G A*, we can similarly speak of u being 
enabled at v and define v[u] so that when v[u] = {v'}, there is a path in the graph from v to V such that u 
is the sequence of move labels of edges along that path. Given v G V and u G A*, if any w-labelled path 
exists in the graph, it is unique. On the other hand, given any sequence of vertices that correspond to a 
path in the graph, there may be more than one sequence of moves that label that path. 

A play in (^,Vq) is an infinite path vo -^V . . ., such that v,- a -^> v !+ i for i G N. We often speak of 
a^a\ ... G A m as the play to denote this path. The game starts by placing a token at vo G V*. Player i 
chooses an action a £ A enabled at vo and the token moves along the edge labelled a to a neighbouring 
vertex vi G Vj. Player j chooses an action a' £ A enabled at vi, the token moves along the edge labelled 
a' to a neighbouring vertex and so on. Note that since there are no dead ends, any player whose turn it is 
to move has some available move. 
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Given a path p = v vi . . . % Vk, we call a^, (1 < £ < k) the last /-move in p, if v^_i G V,- and for all 

£':£<£'<k, v e > $ V t . 

2.1 Objectives 

The game arena describes only legal plays, and the game itself is defined by specifying outcomes and 
players' preferences on outcomes. Since each play results in an outcome for each player, players' pref- 
erences are on plays. This can be specified finitely, as every infinite play on a finite graph settles down 
to a strongly connected component. 

For a play u G A m let inf(w) be the set of vertices that appear infinitely often in the play given by u. 
With each player i, we associate a total pre-order (2 V x 2 y ). This induces a total preorder on plays 
as follows: u -<i u' iff inf(w) -<i inf(V). 

Thus an ra-player game is given by a tuple (Sf,vo,^h,. . . ,^ n ), consisting of an « -player game arena 
and players' preferences. 

2.2 Strategies 

Players strategise to achieve desired outcomes. Formally, a strategy a, for player i is a partial function 

Oi'.VA* 

where ct,(vm) is defined if v[u] is defined and v[u] G V{, and if (v[m])[<7;(vk)] is defined. 

A strategy a, of player i is said to be bounded memory if there exists a finite state transducer FST 
£/ a = (M, 8,g,mo) where M (the 'memory' of the strategy) is a finite set of states, mo G M is the initial 
state of the memory, 8 : A x M — >■ M is the 'memory update' function, and g : Vi x M — >■ A is the 'move' 
function such that for all v G V,- and m G M, g(v,m) is enabled at v and the following condition holds: 
given v G Vi, when u = a\ . . .at G A* is a partial play from v, Oi(vu) is defined, <7,-(vm) = g(v[u],mk), 
where mk is determined by: m,+i = 5(a;+i,m,-) for < / < ^. 

A strategy is said to be memoryless or positional if M is a singleton. That is, the moves depend only 
on the current position. 

Definition 2 Given a strategy profile c = ((7i, ...,o n ) for n players let p^ denote the unique play in 
(5f,vo) conforming to 6. A profile o is called a Nash equilibrium in (Sf,vo, -<i, . . . , -< n ) if for every 
player i and for every other strategy o\ of player i, inf(p( 5 _. <i inf(p^). 

3 Specification of Strategies 

We now describe how the strategies of the imitator and optimiser types are specified. 
3.1 Imitator Types 

An imitator type is again specified by a finite state transducer which advises the imitator whom to imitate 
when using memory states for switching between imitating one player or another. When deciding not to 
imitate any other player, we assume that the type advises what to play using a memoryless strategy. 

An imitator type Ty for player j is a tuple (M, 71,11,8, mo) where M is the finite set denoting the 
memory of the strategy, mo G M is the initial memory, 8 : A x M — > M is the memory update function, 
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71 : V — > A is a positional strategy such that for any v G V, 71 (v) is enabled at v, and /I : M — > [n] is the 
imitation map. 

Given Zj as above, define a strategy a, for player j as follows. Let v£V and w = «i . . G A* is 
a partial play from v such that v[w] is defined and v[u] G Vj. Let = 5(a i+ i ,ra,-) for < i < Then 
<7/(vm) = ci£, if is the last jU(m#) move in the given play and is enabled at vu, and (7/(vw) = 7r(v[w]), 
otherwise. 

Note that the type specification only specifies whom to imitate, and how it decides whom to imitate 
but is silent on the rationale for imitating a player or switching from imitating x to imitating y. In general 
an imitator would have a set of observables, and based on observations of game states made during 
course of play, would decide on whom to imitate when. Thus imitator specifications could be given by 
a past-time formula in a simple propositional modal logic. With any such formula we can associate an 
imitation type transducer as defined above, so we do not pursue that approach here. See, for instance, 
ifTOl for more along that direction. 

The following are some examples of imitating strategies that can be expressed using such automata: 

1 . Imitate player 1 for 3 moves and then keep imitating player 4 forever. 

2. Imitate player 2 till she receives the highest payoff. Otherwise switch to imitating player 3. 

3. Nondeterministically imitate player 4 or 5 forever. 

For convenience of the subsequent technical analysis, we assume that an imitator type 
T = (M, 7V,H,d,mo) is presented as a finite state transducer M x = (M r , 8',g',mj) where 

• M' = V xMxAW. 

• 8' : A x M' — > M' such that 8' (a, (v,m, (a\,... ,a n ))) = {v',m', (ai, . .. ,a,_i ,a,a (+ i, . . . ,a n )) such 
that v A v', 8{a,m) = m' and v G V,. 

• g' : V xM' — >A such thatg'(v, (v,m, (a\ , . . . ,a n ))) =ai iff fl(m) = i anda,- is enabled at v. Otherwise 
g'{v, (v,m, (ai,..., a„)}) = 7F(v). 

• mi = (vo,mo, (at, . . . ,a n )) for some (ai, . . . ,a n ) G a'"L 

Figure 1 below depicts an imitator strategy where a player imitates player 1 for two moves and then 
player 2 for one move and then again player 1 for two moves and so on. She just plays the last move 
of the player she is currently imitating. Suppose there are a total of p actions, that is, |A| = p. She 
remembers the last move of the player she is imitating in the states m\ to m p , and when it is her turn to 
move, plays the corresponding action. 

Given an FST ^ T for an imitator type T, we call a strongly connected component of £% x a subtype of 
3& x . We will often refer to the strategy Oj induced by the imitator type £% x for player j as when the 
context is clear. 

We define the notion of an imitation equilibrium which is a tuple of strategies for the optimisers such 
that none of the optimisers can do better by unilaterally deviating from it given that the imitators stick to 
their specifications. 

Definition 3 In the game (W,vo,^,i,...,-< n ), given that the imitators r + l,,..,n play strategies 
T r+ i, . . . , x n , a profile of strategies 6 = (oj , . . . , <J r ) of the optimisers is called an imitation equilibrium if 
for every optimiser i and for every other strategy a- ofi, inf(p(o-_„<T/)) — i inf (p^)- 

Remark Note that an imitation equilibrium a may be quite different from a Nash equilibrium a' of the 
game (^,vq, -<i, . . . , -< n ) when restricted to the first r components. In a Nash equilibrium the imitators 
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Figure 1 : An imitator strategy 



are not restricted to play according to the given specifications unlike in an imitation equilibrium. In the 
latter case, the optimisers, in certain situations, may be able to exploit these restrictions imposed on the 
imitators (as in the example of 'monkey-chess' discussed in SectionQ")). 



3.2 Optimiser Specifications 

One of the motivations for an imitator to imitate an optimiser is the fact that an optimiser plays to get 
best results. To an imitator, an optimiser appears to have the necessary resources to compute and play 
the best strategy and hence by imitating such a player she cannot be much worse off. But what kind of 
strategies do the optimisers play on their part? 

In the next section, we show that if the optimisers know the types (the FSTs) of each of the imitators, 
then it suffices for them to play bounded memory strategies. Of course, this depends on the solution 
concept: Nash equilibrium is defined for strategy profiles, we need to particularize them for applying 
only to optimizers. 

Thus in the treatment below, we consider only bounded memory strategies for the optimisers. 



4 Results 

In this section, we first show that it suffices to consider bounded memory strategies for the optimisers. 
Then we go on to address the questions raised towards the end of Section Q] 

First we define a product operation between an arena and a bounded memory strategy. 

4.1 Product Operation 

Let (Sf , vo) be an arena and a be a bounded memory strategy given by the FST £/ a = (M, 8,g,mj). We 
define ^ X srf a to be the graph (&',v' ) where <S' = (V',E') such that 
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• V 



VxM 




Oo,m ) 

If g(v,m) is defined then (v,m) A (v',m') iff 8(a,m) = m',v —> V and g(v,m) = a. 
If g(y,m) is not defined then (v,m) A (v',m f ) iff 8(a,m) = m! and v A v' 



Proposition 1 Le? (Sf ,vo) &e arc arena and o be a bounded memory strategy. Then & x srf a is an arena, 
that is, there are no dead ends. 

Proof Let ,v' Q ) = & x srf a . S :AxM —■ M being a function, 8(a,m) is defined for every a G A and 
m G M. Also by the definition of < S, for every vertex v G V there exists an action a € A enabled at v and 
a vertex v'eF such that v A v'. Thus for every vertex (v,m) G V', 

• if g(v,m) is not defined then corresponding to every enabled action a G A there exists (v',m') G V' 
such that (v,m) A (v',m'), 

• if g(v,m) is defined then by definition the unique action a = g(v,m) is enabled at v. Hence, there 
exists (v',m r ) G V' such that (v,m) — > (y',m r ). 



Thus taking the product of the arena with a bounded memory strategy a, of player i does the follow- 
ing. For a vertex v G V,, it retains only the outgoing edge that is labelled with the action specified by the 
corresponding memory state of a,. For all other vertices v £ V,, it retains all the outgoing edges. 

Proposition 2 Le£ vo) &e an arena ana" <7i , . . . , a n be bounded memory strategies. Then x x 
... x £/ 0n is an arena, that is, there are no dead ends. 

4.2 Equilibrium 

Of the n players let the first r be optimisers and the rest n — r be imitators. Let T r+ i, . . . , T„ be the 
types of the imitators r+l,...,n. We transform the game (Sf,vo, . . . , -< ra ) with « players to a game 
(Sf',Vo, . . . , ^+i) w i tn r + 1 players in the following steps: 

1. Construct the graph (&', v(,) = ((V",£')> v d) as ^' = & x x ' ' ' x ^t„- 

2. Let V' = V;u...UV;u V r ' +1 such that for i : 1 < i < r, (v,m u .. .,m n ) G Vj iff v G V,-. And 
(v,mi , . . . , m„) G iff v G V r+ \ U . . . U V n . Let there be r + 1 players such that the vertex set 
Vt belongs to player i. Thus we introduce a dummy player, the r+ 1th player, who owns all the 
vertices (v,m 1; . . . ,m„) G V such that v was originally an imitator vertex in V. By construction, 
we know that every vertex (v,m\,. . .,m n ) G V/ +1 has an unique outgoing edge (v,mi, . . . ,m n ) — >■ 
(v',mi)-", m Ii)- Thus the dummy player r + 1 has no choice but to play this edge always. He has 
a unique strategy in the arena at every vertex of V r ' +1 , play the unique outgoing edge. 

3. Lift the preference orders of the players 1 to r to subsets of V' as follows. A subset W of V' 
corresponds to the Muller set F(W) = {v \ (v,m r+ i, . . . ,m n ) G W} of . For every player i : 1 < 
i < r, for W,W C V', W W if and only if 

Since the player r + 1 has a unique strategy and plays it always, his preference ordering doesn't 
matter in the game. However, for consistency, we assign the preference of an arbitrary imitator (say 
imitator n) in the game ,vo, -<i, . • . , -< n ) to the r+ 1th player in the game (&',v' Q , -<[,..., -<' r+ i). 
That is, for W, W C V, W -<' r+x W if and only if F(W) < n F(W). 



□ 
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The game (W,v' , -<[, . . . , ~<' r , j) is a turn based game with r+l players (the optimisers and the 
dummy) such that each player i has a preference ordering over the Muller sets of V '. Such a game 
was called a generalised Muller game in ifTTTl . 

Let L be the set 

L = {/€(V'u{tt})l y 'l +1 | \l\ t = lA\/veV' (|/| v = l)} 
where |/| v denotes the number of occurences of v in /. We have 

Theorem 1 ( 1111 ) The game (&',VQ,-<\,...,-<' r+ i) has a Nash equilibrium in bounded memory strate- 
gies, the memory being L. 

Now let a' = (a{, . . . , o' r , c^+i) b e a Nash equilibrium tuple for r+ 1 players in the game (&',v' , -<\ 
-<' r+ i)- We now construct a bounded memory imitation equilibrium tuple a for the r optimisers in 
the game (^> ,^i,...,-<„). 

For the optimiser i : 1 < i < r, let a- = (L, 8',g',lj). Define a, = (M, 8,g,lj) to a bounded memory 
strategy in the game , vo , -<i ,...,-<„) as 

• M = M r+ \ x ... x M n x L where M,-, r + 1 < j < n is the memory of strategy T, of imitator i. 

• 8 : A x M — > M such that 8 (a, (m r +\, . . . ,m n ,l)) = (m' r+l ,. ■ -m' n , 8' (a, I)) where m\ = 8i(a,mi), 
r+ 1 < i < n such that 5,- is the memory update of strategy T,-. 

• g:V xM such that g(v,(m r+ i,...,m n ,l)) = g'((v,m r+l , . . . ,m n ),l). 

,lj) where m'j, r+l < i < n is the initial memory of strategy 

We then have: 

Theorem 2 a = (Oi , . . . , a r ) w a« imitation equilibrium in (Sf , vq, -<i , • • ■ , 

Proof Suppose not and suppose player i has an incentive to deviate to a strategy /I in (^,vq,-<i, . . . , -<!„). 
Let m € A m be the unique play consistent with the tuple a where the imitators stick to their strategy tuple 
(t r+ i , ...,T n ). Let u' G A m be the unique play consistent with the tuple (<?_/, /x) (that is when player / has 
deviated to the strategy /i) where again the imitators stick to their strategy tuple (T r+ i, . . . , T„). Let I be 
the first index such that u(l) ^ u'{l). Then, vo[m/-i] € V{, (where m/_i is the length I — I prefix of u). That 
is, the vertex vo[«/-i] belongs to optimiser i since everyone else sticks to her strategy. 

Now consider what happens in the game (&',v' , -<' r+1 , ■ ■ ■ , -<' n ) when all the optimisers except i play 
the strategies a[,..., ct/_j, . . . , a/ +1 , . . . , o' r and the imitators stick to their strategy tuple (x r +i , ■■■,%)• If 
the optimiser i mimicks strategy /I for / — 1 moves in the game then the play is exactly w/_i and reaches a 
vertex (v,m r+ i, . . . ,m„) € V/ where v = vo[«;-i]. By construction of the product, all the actions enabled 
at v in the arena W are also enabled in the arena W. Hence the optimiser i can play u{l). By similar 
arguments, optimiser i can mimick the strategy /I in the arena C S I forever. 

Thus by mimicking % in the game (W,v' , -«y+i> • • • > "^b)> trie optimiser i can force a more preferable 
Muller set. But this contradicts the fact that a' is an equilibrium tuple in the game (^',Vq, ~<' r+ i, • • • , 
□ 

4.3 Stability 

Finally, we adress the questions asked in Section[TJ Given a game vq, -<i, . . . , -< n ) with optimisers and 
imitators where the optimisers play bounded memory strategies and the imitators play imitative strategies 
specified by k finite state transducers we wish to find out: 
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• If a certain stongly connected component W of 5f is where the play eventually settles down to. 

• What subtypes eventually survive. 

• How worse-off is imitator i from an equilibrium outcome. 

We have the following theorem: 

Theorem 3 Let (£f,vo, -<i, • • • , -< n ) be a game with n players where the first r are optimisers play- 
ing bounded memory strategies 0\ , . . . , O r and the rest n — r are imitators playing imitative strategies 
T r+ \ ,T n where every such strategy is among k different types. Let W be a strongly connected compo- 
nent off£. The following questions are decidable: 

( i) Does the game eventually settle down to W ? 

(ii) What subtypes of the k types eventually survive? 

(Hi) How worse-off is imitator ifrom an equilibrium outcome? 

Proof Construct the arena (&',v' ) x &j a ^ x . . . x s&o, x ^ Tr+] x . . . x 

(i) For the strongly connected component S in ,v' ) that is reachable from Vq, let S be subgraph 
induced by the set {v [ (y,mi,... ,m n ) £ S'}. Collapse the vertices of 5 that have the same name 
and call the resulting graph S". Check if S" is the same as W and output YES if so. 

(ii) For the strongly connected component S in (Sf' ,Vq) that is reachable from Vq do the following: 

• For i : r+l < i <n take the restriction of S to the z'fh component for every (v,mi, . . . ,m n ) € 5. 
Let 5, denote this restriction. 

• Collapse vertices with the same name in Sj. Let be this new graph. 

• Check if 5- is a subtype of a,. If so output Sj. 

(iii) Compute a Nash equilibrium p. of the game (5f,vo, -<i, . . . , -< n ) using the procedure described in 
ifTTTl . Let S' be the reachable strongly connected component of the arena (W,v' ). Restrict S' to the 
first component and call it S. Let F = occ(S). Compare F with inf(p jQ ) according to the preference 
ordering <i of imitator i. 

□ 

4.4 An Example 

Let us look at an example illustrating the concepts of the previous section. Consider 3 firms A, B and 
C. Each firm has a choice of producing 2 products, product a or product b repeatedly, i.e., potentially 
infinitely often. In every batch each of them can decide to produce either of the products. 

Now firm A is a large firm with all the technical knowhow and infrastructure and it can change 
between its choice of production in consecutive batches without much increase in cost. On the other 
hand, the firms B and C are small. For either of them, if in any successive batch it decides to change 
from producing a to b or vice-versa, there is a high cost incurred in setting up the necessary infrastructure. 
Whereas, if it sticks to the product of the previous batch, the infrastructure cost is negligible. Thus in 
the case where it switches between products in consecutive batches, it is forced to set the price of its 
product high. This actually favours firm A as it can always set its product at a reasonable price since it is 
indifferent between producing either of the two products in any batch. 

The demand in the market for a and b keeps changing. Firm A being the bigger firm has the re- 
sources and knowhow to analyse the market and anticipate the current demand and then produce a or b 
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Figure 2: The arena £f 



accordingly. Also assume that firm A is the first to put its product out in the market. Thus it is tempting 
for firms B and C to imitate A. But in doing so they run the risk of setting the prices of their products too 
high and incurring a loss. 

We model this situation in the form of the arena shown in Figure 2 where the nodes of firm A, 
B and C are denoted as 0> a an d A respectively. The preferences of each of the firms for the relevant 
connected components when the market demand is low are given as: 

{1,2,3,4,5,6} > A X, forXC {1,2,3,4,5,6} 

{1,3,5} > B {1,4,5} > B {1,3,5,4} > B {2,3,6,4} > B Y, for any other Y C {1,2,3,4,5,6} 

{1,3,5} > c {1,4,5} > c {2,3,6,4} > c {1,3,5,4} > C Z, for any other Z C {1,2,3,4,5,6} 

Thus firm A prefers the larger set {1,2,3,4,5} to the smaller ones while B and C prefer the smaller sets. 
But when the market demand is high their preferences are given as: 

{1,2,3,4,5,6} >iX, for* C {1,2,3,4,5,6} aadie{A,B,C} 

That is, all of them prefer the larger set. 

Now if A produces a and b in alternate batches and B and C imitate A, then we end up in the 
component {1,2,3,4,5,6} which is profitable for A but less so for B and C when the market demand is 
not so high. But when the demand is high, the component {1,2,3,4,5,6} is quite profitable even for B 
and C and thus in this case, imitation is a viable strategy for them. 

5 Discussion 

The model that we have presented here is far from definitive, but we see these results as early reports in a 
larger programme of studying games with player types. The model requires modification and refinement 
in many directions, being addressed in related on-going work. In games with large number of players, 
outcomes are typically associated not with player profiles but with distribution of types in the population. 
Imitation crucially affects such dynamics. Our model can be easily modified to incorporate distributions 
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but the analysis is considerably more complicated. Further, it is natural to consider this model in the 
context of repeated normal form games, but in such contexts almost-sure winning randomized strategies 
are more natural. A more critical notion required is that of type based reduction of games, so that analysis 
of large games can be reduced to that of interaction between player types. 
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